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Abstract: We consider random topologies of surfaces generated by cubic interactions. 
Such surfaces arise in various contexts in 2-dimensional quantum gravity and as world- 
sheets of string theory. Our results are most conveniently expressed in terms of a parameter 
h = n/2 + x, where n is the number of interaction vertices and x is the Euler characteristic 
of the surface. Simulations and results for similar models suggest that Ex[/i] = log(3n) + 
$h ! 7 + 0(l/n) and Var[/i] = log(3n) + 7 — 7r 2 /6 + 0(l/n). We prove rigourously that Ex [h] = 
\ogn + 0(1) and Var[/i] = O(logn). We also derive results concerning a number of other 
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characteristics of the topology of these random surfaces. 
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1. Introduction 

Many years ago, Wheeler [1] argued that in a theory of quantum gravity, large fluctu- 
ations in curvature at small distance scales, mandated by the uncertainty principle, would 
give rise to corresponding fluctuations in the topology of spacetime. Thus, though space- 
time at large distance scales appears to have the simple topology of a ball in Euclidean 
space, at small distance scales its topology may be that of a complicated and dynami- 
cally changing space-time foam. Some of the consequences of space-time foam have been 
explored by Hawking [2,3] and others (see for example Carlip [4,5]) in the context of Eu- 
clidean quantum gravity. Although this work has shed light on the qualitative aspects of 
topological fluctuations, it is difficult to obtain quantitative results, such as the probability 
distributions of topological invar ants, in four or even three dimensions in other than very 
special cases. Indeed, even recognizing topological equivalence (diffeomorphism) is unde- 
cidable for four-manifolds (see Markov [6], Boone et al. [7] and, for an overview of this issue 
and its relationship to quantum gravity, Schleich and Witt [8]), and an open problem for 
three-manifolds. On the other hand, two-dimensional topology is sufficiently tractible that 
quantitative results can easily be obtained for a variety of models. Much of this previous 
work has focused on models in which the contributions from the two dimensional topology 
enters into the theory weighted in a prescribed fashion by other fields. In contrast, this 
paper will explore a two-dimensional model giving rise to a probability distribution on the 
topology alone, independent of any geometric or dynamical considerations. 

Hermitian matrix models provide a prominent example in which the contribution of 
two dimensional topology to quantum amplitudes appears in conjunction with that of other 
fields. The general form of the free energy for such a model is given by 



where the integral is over all N x N Hermitian matrices $, and V(<&) is a polynomial in <E>. 
The large- N limit, in which N — > oo with the coefficients of V fixed, has been shown by 
't Hooft [9, 10] to be dominated by the dynamics of the fields on a surface of genus zero (see 
also Brezin et al. [11] and Bessis al. [12]). The double-scaling limit, in which the coupling 
constant in V varies in a prescribed way as N — > oo, includes contributions to the partition 
function from surfaces of all genera (see Brezin and Kazakov [13], Douglas and Shenker 
[14] and Gross and Migdal [15, 16]). One such model is equivalent to Euclidean quantum 
gravity in two dimensions, which corresponds to a probability distribution in which each 
closed compact connected two-dimensional manifold is weighted by the exponential of its 
Euler characteristic (see also the survey of Di Francesco et al. [17]). 
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String theory provides another example of a theory in which contribution of the topol- 
ogy of two-dimensional surfaces to quantum amplitudes enters coupled to their geometry, 
as induced by an embedding in a higher-dimensional space-time. In perturbative string 
theory, interacting strings have higher-genus worldsheets (see Polyakov [18] and Alvarez 
[19], as well as Polchinski [20] (pp. 86-90) and the references therein). In string field theory, 
Witten [21] proposed a cubic interaction for open strings for which these worldsheets are 
two-dimensional surfaces with boundary (see also Horowitz et al. [22]). Note that string 
theories have natural connections to matrix models; for example, topological strings pro- 
vide a natural connection between supersymmetry in four dimensions and matrix models 
(see Ooguri and Vafa [23]). 

There are also a number of approaches that treat topology in a discrete way. In 
particular, Regge [24] introduced Regge calculus, in which a discrete triangulation forms 
the framework underlying a piecewise flat approximation to a continuous geometry. This 
approach has proven to be a useful framework for the study of many issues in classical and 
quantum gravity in two and more dimensions (see the surveys of Williams and Tuckey [25] 
and Williams [26]). A special instance of this approach, dynamical triangulations, has also 
been extensively used to investigate properties of quantum gravity (see also Ambj0rn et 
al. [27]). Interestingly enough, in two dimensions, Weingarten [28], Ambj0rn et al. [29], 
Frohlich [30], David [31, 32] and Kazakov et al. [33] have shown that certain dynamical 
triangulation models have deep connections with string theory. All of these approaches 
have the property that the topology is prescribed a priori, so that only quantities related to 
geometry can be predicted by the theory. However, there are some generalizations of Regge 
calculus that lead to theories that can allow consideration of different topologies. Ponzano 
and Regge [34] introduced a particular discretization of geometry through spin variables 
(elements of SU(2)) associated with the edges of the triangulation. Their extension is 
specific to three dimensions, but work of Penrose [35, 36, 37] and others (see Barrett and 
Crane [38] and Baez [39, 40]) on spin networks has generalized it to higher dimensions 
by associating spin variables (in SU(2) and other groups) with two-dimensional simplices. 
Certain formulations of spin networks allow consideration of the contribution of different 
topologies. However, as in string theory and matrix models, topological and spin variable 
contributions enter together in the computation of physical quantities. 

In an interesting approach closely related to our model, Hartle [41] has formulated a 
two dimensional Regge calculus model in which one sums over topologies as well as geome- 
tries in computing amplitudes, and which treats the topology in a discrete fashion in a form 
separable from its geometry. Hartle then considers the contribution of pseudomanifolds 
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to quantum amplitudes by calculating the probability distribution of pseudomanifolds for 
small triangulations. 

The model explored in this paper shares with string theory a cubic interaction, cor- 
responding to graphs in which each vertex is incident with three edges. Our model is also 
related to a matrix model with N = 1 and V a cubic polynomial. However, unlike these 
examples, our model focusses on the discrete space of topologies of surfaces, ignoring geo- 
metric and other structure. As elucidated in Section 2, our model is entirely discrete, and 
can be formulated combinatorially in terms of graphs or permutations. Our model shares 
with Hartle's model this discrete nature. However, it differs from Hartle's model in three 
aspects: 1) Hartle's model assigns positive probabilities to pseudomanifolds whereas we 
consider only manifolds, 2) Hartle's model assigns equal probabilities to all pseudomani- 
folds with a given number of vertices whereas we assign equal probabilites to all manifolds 
with a given number of triangles, and 3) Hartle's model requires that the pseudomanifolds 
be simplicial whereas we do not require this of our triangulations. These differences allow 
us to examine the probability distribution of our model for large triangulations. 

2. Probabilistic Models 

We shall focus our attention on one particular probability distribution, but there 
are several mathematical models giving rise to this distribution. The simplest of these 
models, the quotient model, produces a random orientable surface as the quotient surface 
of a number of solid triangles. We shall refer to the elements of the boundaries of these 
triangles as corners and arcs, rather than as vertices and edges (since we shall reserve the 
terms vertices and edges for elements in the thin graph model presented below). Take the 
3n arcs of an even number n of oriented triangles and identify them in pairs, respecting 
the orientation, so that the resulting surface is orientable, and with all (Sn — 1)!! = (3n — 
1) ■ (3n — 3) • ■ • 3 ■ 1 pairings being equally likely. (These pairing may identify two arcs from 
the same triangle, or more than one pair of arcs between two triangles.) The surface is 
triangulated with n triangles, and their 3n arcs before identification become 3n/2 arcs after 
identification. Let the random variable h denote the number of corners after identification, 
and let x denote the Euler characteristic of the surface. Then we have x = /i — 3n/2 + n, 
so that h = x + n/2. The distribution of h will be the main object of study in this paper, 
but studying h is equivalent to studying x- We note that, since x is even, h has the same 
parity as n/2. We shall see in Section 3 that with probability 1 — 5/18n + 0(l/n 2 ) the 
surface is connected, and thus consists of a single component. In this case, the genus g 
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of the surface is given by (7 = 1 — x/2 = l + n/4 — h/2. Thus in this case studying h is 
equivalent to studying g. 

A variant of the quotient model is the fat-graph model. Take n triangles as before, but 
instead of identifying their arcs in pairs, join them in pairs with rectangular ribbons, again 
in such a way that the resulting surface is orientable, and again with all (3n — 1)!! pairings 
being equally likely. The resulting surface in this case will have a non-empty boundary, 
comprising one or more boundary cycles. If we shrink each boundary cycle to a point 
(or if we cap off each boundary cycle with an appropriate polygon), we obtain a surface 
topologically equivalent to that produced by the quotient model for the same pairing. 
The points produced by shrinking boundary components correspond to the corners after 
identification in the quotient model, so that the number of boundary components is h. We 
shall exploit this correspondence in Sections 4 and 5 to study h by analyzing an algorithm 
that traces out the boundary cycles in the fat-graph model. 

Another variant of the quotient model is the thin-graph model. In the quotient surface, 
install a vertex corresponding to each triangle and install an edge joining two vertices for 
each pair of arcs of the corresponding triangles that are identified. (Alternatively, one may 
shrink each triangle in the fat graph to a vertex, and shrink each ribbon to an edge. In 
any case, the graph may have "loops" (edges joining vertices to themselves) and "slings" 
(multiple edges joining a single pair of vertices).) The result is a cubic graph (each vertex 
has degree 3), with an additional structure: the three edges incident with each vertex 
have one of two possible cyclic orderings. As was observed by Heffter [42], these cyclic 
orderings allow the quotient surface or fat graph to be reconstructed from the thin graph. 
The thin graph may be regarded as a bubble diagram for a cubic interaction, and it is 
this viewpoint that forms the physical basis for our models. The thin graph, even without 
the cyclic ordering structure, reveals the number of connected components of the random 
surfaces in the quotient and fat-graph models. We shall exploit this fact in Section 3 to 
estimate the probability that these surfaces are connected. 

A final variant of these models is the permutation model. Here we ignore surfaces 
and graphs and simply consider a probability distribution on permutations. Let g be a 
permutation of 3n elements having cycle structure [2 3n / 2 ] (say (12) (34) • ■ • (3n — 1 3n)), and 
let a be a permutation having cycle structure [3 n ] (say (123) (456) • • • (3n — 2 3n — 1 3n)). 
Let 7T be a random permutation, with all (3n)! permutations being equally likely. Then 
7r £7r -1 is a random permutation uniformly distributed over the permutations with cycle 
structure [2 3n / 2 ]. If we think of the cycles of a as corresponding to the triangles of the 
fat graph model and those of tvqtv -1 as corresponding to the ribbons, we see that the 
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cycles of n g n 1 a correspond to the boundary cycles in the fat graph. We shall exploit 
this correspondence in Section 6 to to study the probability that h = 1. 

For n = 2 triangles, there are 5!! = 15 possible pairings, but they fall into just three 
combinatorial equivalence classes. (Two pairings are combinatorially equivalent if they lie 
in the same orbit under the action of the symmetry group that permutes the triangles and 
cyclically permutes their boundary elements (without changing their orientation).) Three 
representative pairings for these equivalence classes are shown in Figure 1; alongside them 
are shown their corresponding thin graphs (or their fat graphs with very small triangles 
and very long and slender ribbons) . The arrows indicate the cyclic orderings of the edges 
at each vertex. These three representatives are in classes of sizes 9, 3 and 3 (from top to 
bottom); the upper two yield spheres (h = 3), while the lowest one yields a torus (h = 1). 
Thus the probability of a sphere is (9 + 3)/15 = 4/5, while the probability of a torus is 
3/15 = 1/5. 

There are two other mathematical models that give rise to different probability dis- 
tributions, but which yield results similar to those we have observed for random sur- 
faces generated by cubic interactions. The first of these is simply a random permu- 
tation of n elements, with all nl permutations being equally likely. Let h! denote the 
number of cycles in such a random permutation. We note that h! may take any value 
from 1 to n. The probability distribution of b! is given by the generating function 
in which Pr[/i' = k] is the coefficient of £ fc . We have i = H n , 

where H n = J2i<k<nl = l°g n + 7 + 0(l/n) and 7 = 0.5772... is Euler's con- 
stant, and (^r^" 1 ))^ = H l ~ H n \ = Ekk^ = + 0(l/n) and 

tv = 3.14159 ... is the circular ratio. These results yield Ex[/i'] = logn + 7 + 0(l/n) and 
Var[/i'] = logn + 7 - tt 2 /6 + 0(l/n). We also note that Pr[h' = 1] = 1/n. 

Another analogous model was introduced by Harer and Zagier [43] . Instead of starting 
with n triangles, each having 3 arcs, they start with a single polygon having 2n arcs, 
and again identify the arcs in pairs to obtain an orientable surface, with all (2n — 1)!! 
pairings being equally likely. Let h" denote the number of equivalence classes of corners 
in the resulting quotient surface (or equivalently the number of boundary cycles in the 
resulting fat graph). We note that h" has the opposite parity from n. They showed that 
the generating function for h" is \ J2i+ m =n+i (2) ( ?+ m _1 ) ^ expressing this generating 
function as a £-fold integral over a Gaussian Hermitian ensemble. (Subseqently, Penner 
[44] derived this result using perturbative series, and Itzykson and Zuber [45] derived in 
two further ways, one using group representations and another expoiting an analogy with 
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the second quantization of the harmonic oscillator.) We have = 1 if / = 1 and 

(-1)7/(7 - 1) otherwise, and (^r(f)) = if / = 1 and 2(-l) z " 1 (H t _ 2 - - 1) 

otherwise. These results yield Ex[/i"] = log(2n) + 7 + 0(l/n) and Var[/i"] = log(2n) + 7 — 
tt 2 /6 + O ((log n)/n). We also note that Pr[/i" = 1] = l/(w + 1). 

We have conducted an empirical study of 10,000 random surfaces, each constructed 
from 80,000 triangles. The sample mean of h was 13.1092 and the sample variance was 
11.3216 .... Since log(240,000) +7 = 12.9656 ... and log(240,000) +7-tt 2 /6 = 11.3206 
these results strongly suggest the conjecture that 



Ex[h] = log(3n) +7 + O 

and 



Var[/i] = log(3ra) + 7 - y + O (J^J 



These conjectures may be compared with the results for the Harer-Zagier model: in each 
case the argument of the logarithm is the total number of arcs of the original polygons, 
and all remaining constant terms are identical. We have not been able to verify these 
conjectures at the stated level of precision, but in Section 4 we shall prove that 

Ex[h] = logn + 0(l) 

and in Section 5 we shall prove that 

Var[/i] = O(logn). 

Since Var[/i]/Ex[/i] 2 — > as n — > 00, this implies that the distribution of h is strongly 
concentrated about its mean. 

In Section 6 we shall study the probability that h = 1. We use a technique based on 
representations of the symmetric group to show that show that 



Pr[h = 1] = 



0, if n/2 even; 

if n/2 odd. 



This result may be compared with that for the Harer-Zagier model: in each case the 
probability, when it does not vanish, is asymptotic to the reciprocal of the number of 
edges in the thin graph. 
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In Section 7 we shall present a classification of the boundary cycles in the fat-graph 
model according to their "self-interactions" , and give a heuristic estimation of the expected 
number of "simple" cycles of various orders. In Sections 8 and 9 we give exact expressions, 
and rigourously derive asymptotic estimates, for the number of simple cycles having the 
two lowest orders in this classification. 

In the remainder of this paper, the variable n will always denote the number of 
triangles, and the random variable h will always denote the number of boundary cycles 
in the fat graph; all other variables may be used with different meanings from section to 
section. Unless otherwise indicated, the notation 0(- ■ •) and 0(- • •) will refer to asymptotic 
behaviour as n tends to infinity through even integers. 

3. The Probability of Connectedness 

Theorem 3.1: Let c be the number of connected components of a random surface obtained 
from the quotient model with n triangles. Then 



which is equivalent to the theorem. We shall work with the thin graph, which has the 
same number of connected components as the quotient surface. 

First we show that 



We may assume n > 4, since since all cubic graphs on 2 vertices are connected. Let V 
be the set of n vertices of the thin graph. Say that a subset P of the vertices of the thin 
graph is closed of no edge of the thin graph joins a vertex in P to one in V \ P. (Thus 
a closed set is one that comprises one or more connected components of the thin graph.) 
The event c > 2 is equivalent to the existence of a closed set P with ^ P ^ V . If P is 
such a closed set, then so is V \ P, and at least one of these sets must contain at most n/2 
vertices. Thus 




Proof: We shall show that 




(3.1) 




(3.2) 




Pr[P closed]. 



#P<n/2 
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A closed set must contain an even number of vertices, and there are ( 2 ™ ) sets containing 
2k vertices. The probability that a given set P containing 2k vertices is closed is 

(6fc-l)!!(3ra-6fc-l)!! 
(3n- 1)!! ' 

since there are (6k — 1)!! ways of constructing a thin graph on P and (3n — 6k — 1)!! ways 
of constructing a thin graph on V \ P. Thus we have 

Pr[c > 2] < F ^ 

l<fc<n/4 

where 

_ (6k- l)!!(3n-6fc- 1)!! f n 

i 1 k — 



(3n-l)!! \2k, 

We shall show that F k is a non-increasing function of k for 1 < k < n/2. The ratio of 
consecutive terms, 

F k+1 (6k + 5)(6fc + l)(n - 2k) 

Pk ~ 



F k (3n-2k-l)(3n-2k-5)(2k + 2)' 
is unity when the difference between its numerator and denominator, 

S k = (6k + 5)(6k + l)(n - 2k) - (3n -2k- l)(3n -2k- 5)(2k + 2), 

vanishes. This cubic polynomial in k has roots at ko = n/4 — 1/2 and 

3n - 6 ± V9n 2 + 36n + 16 



fc± = 



12 



Since 5fe ~ —144k 3 for — > ±oo, S k < and thus < 1 for fc_ < k < ko. Since n > 4, 
we have 144n > 308, which implies that k- < 1. Thus we have F k +i < F k for I < k and 
k + 1 < n/4. This implies 

Pr[c > 2] < Fi + F 2 + (n/2 - 2)F 3 . 

Since Fi = 5/18n + 0(l/n 2 ), F 2 = (9(l/n 2 ) and F 3 = 0(l/n 3 ), this inequality yields (3.2). 
Finally we show that 

F ^ 2 ^Wn + °(^)' (3 ' 3) 
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To do this we focus our attention on closed sets containing 2 vertices. We have 
Pr[c > 2] > J^Pr[P closed] - ^Pr[P closed, Q closed], 

P P,Q 

where the first sum is over all P with j^P = 2, and the second sum is over all unordered 
pairs of distinct sets P and Q with j^P = j^Q = 2. The first sum contains (™) = n 2 /2 + 
0(n) terms, each equal to 5!! (3n-7)!!/(3n- 1)!! = 5/9n 3 + 0(l/n 4 ). For the second term, 
Pr[P closed, Q closed] vanishes unless P and Q are disjoint. Thus the second sum contains 
(£)(W 2 = 0(n 4 ) non-vanishing terms, each equal to 5!! 5!! (3n - 13)!!/(3n - 1)!! = 
0(l/n 6 ). Substituting these results in (4) yields (3). 

Inequalities (3.2) and (3.3) yield (3.1), which completes the proof of the theorem. □ 



4. The Expected Number of Cycles 

Our main result is an estimate for the parameter h, regarded as the number of bound- 
ary cycles in the fat-graph model with n triangles. 

Theorem J^.l: As n tends to infinity through even integers, we have 

Ex[/i] = logn + 0(l). 

Let c be one of the 3n corners of the original n triangles. Let pk denote the probability 
that c lies in a boundary cycle of length k. 

Proposition 4-2: 

Ex[/i]=3n 

l<fc<3n 

Proof: Let hk denote the number of boundary cycles of length k. Then 

Ex[/i]= Ex W 

l<fc<3n 

Let Cfc denote the number of corners in boundary cycles of length k. Then Ck = khk, so 
that 

Ex[/ t ]= -£ ^M. (4.1) 

l<A;<3n 

The probability distribution is invariant under a group of symmetries that includes per- 
mutations of the triangles and cyclic permutations of the corners of each triangle. Since 
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this group acts transitively on the 3n corners, we have Ex[c/j] = 3npk- Substituting this 
into (4.1) yields the proposition. □ 

Our next step is to obtain estimates for pk. This will be done through the following 
two lemmas. 

Lemma 4-3: For 1 < k < n, we have 



We shall prove these two lemmas below. For now, let us see how they combine to 
proove the following proposition. 

Proposition 4-5: For 1 < k < n/2, we have 




Lemma 4-4'- For 1 < k < n/2, we have 





as n tends to infinity through even integers. 
Proof: From Lemma 4.3, we have 




From Lemma 4.4, we have 




Combining these bounds yields the proposition. □ 
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This proposition, together with Proposition 4.2, allows us to prove Theorem 4.1. 

Proof of Theorem J^.l: Since the sum of the lengths of all boundary cycles is 3n, there can 
be at most 5 boundary cycles of length exceeding n/2. Thus Proposition 4.1 yields 

Ex[/i]=3n y + 0(l). 

l<fc<n/2 

Evaluating the sum using Proposition 4.5 yields 

Ex[/ t ]= £ (1 +0 (I)) +0(1) 

l<fc<n/2 V \ / / 

= logn + 0(l), 

since Ei< fc <n/2 i = logn + 0(1). □ 

It remains to prove Lemmas 4.3 and 4.4. To do this, we shall analyze the following 
randomized procedure cycle, which constructs the boundary cycle containing the corner c, 
and returns its length as cycle(c). 

Since we are dealing with orientable surfaces, we may regard them as two-sided, and 
may imagine one side coloured green and the other coloured orange. We shall regard the 
arcs of the triangles as being directed clockwise as seen from the green side. For any corner 
d of a triangle T, let d~ denote the preceding corner and d + the following corner in the 
boundary of T, so that (d, <i + ), (d + , d~) and (d~,d) are the directed arcs in the boundary 
of T. 

Each ribbon installed by the following procedure is a quadrilateral containing four 
corners and four arcs. Two of these arcs come from the triangles joined by the ribbon; 
the other two will be called links. Links will be directed so that the arcs of a ribbon 
are directed counterclockwise as seen from the green side. Links are initially unmarked, 
but may subsequently be marked to indicate that they are part of the boundary cycle 
containing c. 

The procedure uses a data structure called the urn. The urn contains at any time a 
set of corners. This set is initialized by putting the 3n corners into it. A specific corner 
may be removed from the urn, or a random corner may be drawn from the urn; the corner 
removed in this way is equally likely to be any of the corners currently in the urn. 

integer procedure cycle (corner c); 
begin 

corner head, tail, next, point; 
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integer length; 
head := c; 
tail := c; 
length := 0; 

put the 3n corners into an urn; 
repeat 

remove head from the urn; 

next := draw from the urn; 

install a ribbon with arcs (head, next~), (next~ , next), (next, head~) 

and (head~ , head) in counterclockwise order as seen from the green side, 
and introduce (head, next~) and (next, head~) as unmarked links; 

while there is an unmarked link (head, point) do 
begin 

mark the link (head, point); 
length := length + 1 ; 
head : = point 
end; 

while there is an unmarked link (point, tail) do 
begin 

mark the link (point, tail); 
length := length + 1 ; 
tail : = point 
end; 
until head = tail; 
return length 
end 

The repeat . . . until statement is analogous to a while . . . do statement, except that 
the body of the statement is executed before, rather than after, the condition is tested. 

Figure 2 shows the three possibilites for the installation of the first ribbon; unmarked 
links are shown dashed, and marked links are shown bold. The most common case, in 
which the corner d drawn from the urn lies on a different triangle from c, is shown on the 
left; the special cases in which d = c~ and d = c + are shown on the upper right and lower 
right, respectively. 

The probability pk that the corner c is in a boundary cycle of length k is simply the 
probability that the procedure invocation cycle(c) returns the value k. Our estimates for 
Pk will therefore be based on an analysis of this procedure. 

Let us consider the subgraph formed by the corners and the links at some point in 
the execution of the procedure. By the in-degree of a corner we shall mean the number 
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(zero or one) of links directed into it, and by the out- degree, the number (also zero or one) 
directed out of it. This subgraph then comprises one or more components, each of which 
is either a path (a part of a boundary cycle) or a complete boundary cycle. Initially, there 
are 3n paths, each of length zero. As links are introduced, paths may be extended, merged 
or closed into cycles. In a path, the corner with out-degree zero is called the front and the 
corner with in-degree zero is called the rear. At any time, there is one marked path. Its 
front is called the head and its rear is called the tail. The variable length keeps track of 
the length of the marked path. Initially the head and tail coincide at c. When they again 
coincide after one or more ribbons have been installed, the marked path closes into a cycle, 
the procedure terminates by returning the length of this cycle. We observe that the length 
of the marked path increases by at least one at each execution fo the repeat . . . until 
statement, but may increase by more if previously unmarked paths are merged at its head, 
or tail, or both. 

To analyze the behaviour of the procedure, it will be convenient to represent its 
possible executions as a tree. This tree will comprise a number of nodes, each representing 
a state of execution of the procedure, joined by branches, each representing an execution 
of the body of the repeat . . . until statement. One node, called the root of the tree, 
corresponds to the initial state of the procedure just before the first execution of the body 
of the repeat . . . until statement. Other nodes, called the leaves of the tree correspond to 
the final states of the procedure after the last executions of the body of the repeat . . . until 
statement. Each node other than the root has one or more children, corresponding to the 
states reach after vaious corners are drawn from the urn. Each node that is not a leaf has 
a unique parent, corresponding to the immediately preceding state in the execution. The 
ancestors and descendents of a node are defined in the obvious way. 

With each node K we may associate a value depth(K) (corresponding to the number 
of installed ribbons), as well as values head(K), tail(K) and length(K), in the obvious 
way. Since each installation of a ribbon adds either one or two marked links, we have 

depth(K) < length(K) < 2depth(K). 

All nodes at the same depth d correspond to states in which the same number 3n — 2d of 
corners (including the current head) remain in the urn, and thus all of these nodes have 
the same number 3n — 2d — 1 of children. If control reaches some node of the tree, it is 
equally likely to proceed to each of its children. We shall say that a node K is shallow if 
depth(K) < k — 1. If K is shallow, the probability of proceeding to any particular child 
of K is at least l/3n and at most l/(3n — 2k + 1). Finally, we shall call an internal node 
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K (that is, a node other than a leaf) double if head(K)~ = tail(K), and call it single 
otherwise. If a node is double, then in proceeding to one of its children both newly added 
links will be marked, otherwise only one will be marked. (In either case, previously added 
links may also be marked.) 

We are now erady to prove Lemmas 4.3 and 4.4. 

Proof of Lemma 4-3: Let E k denote the event that execution terminates at a leaf L with 
length(L) = k, so that p k = Pi[Ek\. If K is a node other than the root, let K* denote its 
parent. We shall write 

p k = Pr[E k ]=Pr[E 1 k ]+Pr[El} 7 (4.2) 

where E\ (respectively, E%) denotes the event that execution terminates at a leaf L such 
that length(L) = k and L* is single (respectively, double). 

Let us first consider an upper bound for Pr If El occurs at L, we shall call L* 
a precursor for E\. If the node K is a precursor for E^, then (1) K has exactly one child 
at which El occurs (this child corresponds to drawing the corner tail(K) + from the urn 
as next), (2) length(K) = k — 1 (since proceeding to the child at which El occurs will 
add one to length, resulting in length = k), and (3) K is shallow (since installing a ribbon 
increases length by at least one, so that depth(K) < length(K) = k — 1). Letting K also 
denote the event that control reaches the node K, we have 

I K] = — j— - - < 1 



3n - 2depth(K) - 1 ~ 3n - 2k + 1 

Along any path from the root to a leaf in the tree, at most one node can be a precursor 
to El (since length strictly increases along any such path). Thus 

E Fr W < i. 

precursor K 



to El 



The last two bounds together yield 



Pv[El] = J2 Pr ^ I K ^ Fr W 
precursor K 
to El 

< V Pt[K] 

~ 3n - 2k + 1 ^ L J 

precursor K 

to El 
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Next let us consider an upper bound for Pr[£$. At a given node K, we shall say that 
a corner a leads to a corner (3 in I steps if a is the rear and (3 is the front of an unmarked 
path of length / at K. A corner 7 such that 7" leads to 7 in I steps will be called a reflector 
of size /. If £f occurs at a leaf L, so that L* is double, we shall call L* a precursor for 
If if is a precursor for E%, then (1) there exists an / > 1 such that all children 
of K at which E\ occurs correspond to drawing reflectors of size / from the urn, and (2) 
length(K) = k — l — 2 (since proceeding to a child at which E\ occurs will add 1 + 2 to length, 
resulting in length = k), and (3) depth(K) < k — 3 (since depth(K) < length(K) < k — 3). 

If a is a reflector of size / at node K, and if length(K) = k — I — 2, then K will be 
called a precursor for a. Let be the event that corner a is drawn at node K. Then 

Pr[ ^ 1 K] = 3n-2depth(K)-l ~ 3n -2k + 5' (4 ' 4) 

Let RJ denote the event that corner a becomes a reflector at some child of node J. 
For any node J, there is at most one corner a (namely head(J) + ) that can become a 
reflector at a child of J, and if there is such an a, then there is just one child of J (namely 
the one corresponding to drawing the corner (3 from the urn, where (3 is the rear of the 
unmarked path with a as its front) at which a becomes a reflector. Thus, for any node J 
of depth at most k — 1, 

? Pr|ii ° 1 J] £ 3n -2W)-i - rarr- («) 

For E\ to occur, some corner a must become a reflector of size / at a child of some 
shallow node J, and then at some descendant K of J that is a precursor for a, a must be 
drawn from the urn. Thus 

Pr[E 2 k ]= Yl Pr[J}^Pr[R a j\J} Pt[K\J]Pt[S%\K}. 

J shallow a K precursor for a 

K descendant of J 

Using (4.4), we have 

Pr[El] ~ 3n - Ik + 5 £ Pr[J]$>[^M £ Pr[* | J]. 

J shallow a K precursor for a 

K descendant of J 
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If a is a reflector of size /, a node K can be a precursor for a only if length(K) = k — l — 2. 
Since length strictly increases along any path descending from J, at most one node on any 
such path can be a precursor for a. Thus we have 

E F ^ K i J ] ^ L 

.ftT precursor for a 
K descendant of J 

This yields 

P ^]< E Pr[J]E Pr ^l J ]' 



3n-2A; + 5 7 , n 

J shallow 



Using (4.5), we have 



Pr ' E ^ 3„-2 fc + 1 3„-2 fc + 5 £ Pr W' 

J shallow 

Finally, since there are at most k nodes of depth at most k — 1 on any path from the root 
to a leaf in the tree, we have 

E Pr W < k. (4.6) 
J shallow 

This yields 

Pr[El] < ' 1 



3n - 2k + 1 3n - 2/c + 5 
Combining this inequality with (4.3) in (4.2) completes the proof of Lemma 4.3. □ 

Proof of Lemma 4-4 : From (4.2), we have PrfE^] > Pr[E^], so it will suffice to obtain a 
lower bound to Pr[i?^]. Let us say that a node if is a strong precursor to E\ if (1) every 
ancestor of K (including K itself) is single, and (2) there are no reflectors at any ancestor 
of K, and (3) length(K) = k — 1, and (4) K is not a leaf. At a strong precursor K to E\, 
there is exactly one child at which E\ occurs. Thus, for a strong precursor K to E\, 

p A E k I K \ = 3 n _ 2depth(K) - 1 ~ 3^' 

Let Ak be the event that control reaches a strong precursor to E\. Since length strictly 
increases along any path from the root to a leaf in the tree, at most one node on any such 
path can be a strong precursor to E\. Thus 

PrL4 fc ] = E Pl W 

K strong 
precursor to E\ 
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It follows that 



Pr[E k ] > Pr[El] 



K strong 
precursor to E\ 

> I E iM*1 

K strong 
precursor to E\ 

= ± Pr[AJ. (4.7) 

It remains to obtain a lower bound for PrfA^]. 

Let B k denote the event that control reaches a shallow double node K. Let C k denote 
the event that some corner becomes a reflector at a shallow node K. Let F k denote the 
event that the procedure terminates at a leaf L with length(L) less than k. Let G k denote 
the event that control reaches a node J with length(J) at least k, without ever passing 
through a node K with length(K) = k — 1. Then we have 

Pi[A k ] > 1 - Pv[B k ] - Pv[C k ] - Pv[F k ,W k ] ~ Pr[G fc ,B^C£]. (4.8) 

It remains to obtain upper bounds for the four probabilities on the right-hand-side. 

First we deal with Pr[Sfc]. If a node K is double, but K* is single, we shall call 
K* a double precursor. If a node J is a double precursor, then (1) J is single, so that 
head( J)~ ^ tail(J), and (2) J has exactly one child of J that is double (namely the one 
corresponding to drawing the corner head(J)~ from the urn), and (3) head(J) + leads to 
tail( J) + . Let MJ denote the event that at node J, control passes to a double child of J 
by drawing corner a from the urn. If B k occurs, then MJ must occur for some a at some 
shallow node J. Thus 

Pv[B k ]< J2 PAJ]J2 Fr i M j\ J ]' 
J shallow Q 

Each term in the sum over a vanishes except possibly for the one with a = head(J)~ , so 
that we have 

VPr[M? I J] < 1 —— < . 

^ L Jl 1 ~ 3n-2depth(J) -1 ~ 3n-2k + l 
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This yields 

Pl[Btl ^ 3r»-L + l £ Pl 'M- 

J shallow 

Using (4.6), we obtain 

k 

PrfSfcl < , (4.9) 

L J - 3n-2k + 1' V ; 

which is the desired upper bound for Pr 
Next we turn to Pr[Cfc]. We have 

Pr[C k ]< Pr[J]^Pr[^| J]- 

J shallow a 

Using (4.5), we have 

J shallow 

and using (4.6), we obtain 

which is the desired upper bound for Pr[Cfc]. 

Next we deal with Pr[F^, Bk ]. If -Fit occurs but does not, then Ej must occur for 
some 1 < j < k — 1. Thus 

Pr[F fc ,Sl]< ]T Pr[^]. 
i<j<fc-i 

Using (4.3), we obtain 

Pr\F k ,B^} < , (4.11) 

which is the desired upper bound for Pr[Ffc,5fc]. 

Finally, we turn to Pr[Gfe, B^, C k ]. At a given node J, a corner a will be called a 
deflector of size / if ct~ leads to some corner (3 in / steps. Initially all deflectors are of size 
zero. As links are added, deflectors may be extended, merged into marked or unmarked 
paths or destroyed by being closed into unmarked cycles. A deflector a of size / will be 
called an exit from J if (1) length(J) < k — 2, and (2) length(J) + 1 + 1 > k. If G k occurs 
but Bk and Cu do not, then at some shallow node J the corner drawn from the urn must 
be an exit from J. Thus 

Pr[G fc ,B^]< Pr t J ] E Pr [ JQ ! J ]> 

J shallow exit a 

from J 
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where J a denotes the child of J reached by drawing a from the urn at J. Since J is 
shallow, we have 

Pr[.T | J] < ' 



3n - 2k + 1 
This yields 

Pr[G k ,B- k ,C- k ] < — L— J2 ^\J]Xj, 

J shallow 

where Xj denotes the number of exits from J. If tt is a path from the root to a leaf in the 
tree, we shall let 

Y ir = J2 x -j 

shallow J on tt 

denote the number of exits from shallow nodes on tt. Then we have 

J shallow n 

where the sum on the right-hand side is over all paths tt from the root to a leaf in the tree, 
and Pr[-7r] is the probability that control follows the path tt. Thus 

Pv[G k ,B- k ,C~k] < 3 w _2 fc + 1 E Pr W^- (4-12) 

7T 

We shall show that 

< k (4.13) 

for every path tt. To do this, we shall charge each exit from a shallow node J on tt against 
an unmarked link that is added at a shallow node on tt, in such a way that at most one 
exit is charged against any link. Since there are at most k — 1 shallow nodes on tt, each of 
which adds at most one unmarked link, this will prove (4.13). 

Suppose that corner a is an exit from node J on tt. Let s(a, J) denote the size of the 
deflector a at J. Since a is an exit from J, we must have 



length(J) + s(a, J) + 1 > k, 

so that the path from a~ contains at least k — 1 — length(J) > 1 links. We shall charge the 
exit a from J against the (k — 1 — length( J) )-th link (counting from the rear) in the path 
from a~ . It remains to verify that at most one exit from tt is charged against any link. 
Suppose that exit a from node J is the first exit from tt charged against link A. Any exit 
j3 from a later node K on tt that is charged against a link on the path containing a~ will 
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be charged against a link that appears behind A on this path (since length(K) > length(J) 
and any links in the path from a~ will also appear in the path from (3~). This completes 
the proof of (4.13). 

From (4.12) and (4.13) we have 

p ''[e t ,Br,cf]< sr -! FTT $> W . 

Since 

£Pr[7T]<l, 

7T 

we obtain 

Pr[G*,TO < 3^TT' 

which is the desired upper bound for Pr[Gfc, Cfc ]. 

Substituting (4.9), (4.10), (4.11) and (4.14) into (4.8), yields 

4k 

Pr[A k ] > 1 — , 

and substituting this into (4.7) completes the proof of Lemma 4.4. □ 

5. The Variance of the Number of Cycles 

Theorem 5.1: 

Var[/i] = O(logn). 

Proof: The proof is similar to that in Section 4, so we shall merely sketch it. We have 

Vax[h] = Ex[/i 2 ] -Ex[/i] 2 , 

and we have seen that 

Ex[h] = log w + 0(1). 
Thus to prove the theorem it will suffice to show that 

Ex[h 2 ] = (logn) 2 +0(logn). (5.1) 

We have 



Ex[/i 2 ]= Yl Mhk-h 



Kfe<3n Kfc'On 
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where h k denotes the number of cycles of length k. 

Our first step is to reduce the range of the summations over k and k'. Since there are 
at most 5 cycles of length exceeding n/2, we have 

Ex ^ fc ' hk '} - Y Y E ^ hk ' hk '^ + 10Ex M + 25 

l<fc<3n l<fc'<3n l<fc<n/2 l<fc'<n/2 

= Yl Y Mhk ■ h k ,]+0 (log n). 

l<fc<n/2 l<fc'<n/2 

Thus to prove (5.1) it will suffice to show that 

£ Mhk-h k ,] = (logn) 2 + 0(logn). (5.2) 

l<fc<n/2 l<fc'<n/2 

We have 

Pr[c in k, c' in fc'] 



yj yj Ex[ fti .M=EE E ; E 



k • k' 

l<fc<n/2 l<fc'<n/2 c c' l<fc<n/2 l<fc'<n/2 

where the outer sums are over all corners and the event "corner c is in a cycle of length k" 
has been abbreviated "c in k" . 

If c and c' are corners of the same triangle, we shall write c ~ c', otherwise c c'. We 

have 



c c'~c l<fc<n/2 l<fc'<n/2 



Pr[c in /c, c' in k'] 

Pr[c in /c] Pr[c' in k' \ c in k] 



EE E E 



c c'~c l<fc<n/2 l<fc'<n/2 

^ Pr[c in fc] Pr[c' in k' \ c in k] 

c c'~c l<k<n/2 l<fc'<n/2 

^ Pr[c in k] 

c c'~c l<fc<n/2 ^ 

< 3 ^ ^ Pr[c in fc] 

c l<fc<n/2 ^ 

< 3 Ex[/i] 
= O(logn), 
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since Xa<fc'<n/2 P r [ c ' in fc' | c in k] < 1, and for any corner c, there are just 3 corners d 
such that c ~ d. Thus to prove (5.2) it will suffice to show that 

EV^ Prfc in k, d in k'\ , 2 s ,„ nS 

EE E k . k , I = (logn) 2 + 0(logn). (5.3) 

c c'/c l<fc<n/2 l<fc'<n/2 

The probability distribution is invariant under a group of symmetries that includes 
permutations of the triangles and cyclic permutations of the corners of each triangle. Since 
this group acts transitively on the 9n(n — 1) pairs (c, d) of corners such that c / d , we 
have 

v-^ v-^ v-^ Pr[c in k, d in k'] 

2.^1 2-^< 2.^1 2-^< k~k~' 

c c'rjLc l<fc<n/2 l<k'<n/2 

v-^ Pr\c in k, d in fc'l 
= 9n(n-l) £ E fc.fc/ I ' 

l<fc<n/2 l<fc'<n/2 

where (c, c') is an arbitrary pair of corners such that c d . Since and fc' now appear 

symmetrically in the last sum, we have 

Pr\c in k, d in fe'l 
9n(n-l) £ E fe.fe/ 1 

l<fc<n/2 l<fc'<n/2 

^ io / i\ Pr[c in c' in 

<18n(n-l) 2^ 2^ — 



k ■ k' 

l<fc<n/2 k<k'<n/2 

Thus to prove (5.3) it will suffice to show that 

18n(n-l) )^ }^ — ^ ^ ^ = (logn) 2 + 0(logn). (5.4) 

l<fc<n/2 k<k'<n/2 

Now we shall express the event "corner c is in a cycle of length k and corner d 
is in a cycle of length fc"' as the disjoint union to two events: "corner c is in a cycle 
of length k and corner c' is in a disjoint cycle of length fc"', which will be abbrevi- 
ated "c in fc, c' in disjoint fc'", and "corner c is in a cycle of length k and corner d is 
in the same cycle of length fc'", which can occur only if k = k' and which will be ab- 
breviated "c in fc, c' in same k" . We may also refer to the events "c' in disjoint k'" and 

"c' in same k" conditioned on the event "c in k" . We have 
^ ^ Pr[c in fc, c' in A;'] 
18n(n-l) ^ E fcTtf 

l<k<n/2 k<k'<n/2 



v-^ v-^ ? r k in fc, c' in disjoint fe'l 
= 18n(n-l) ^ £ -± L__J 1 

l<k<n/2 k<k'<n/2 

, . v-^ ? r k in fc, c' in same fcl 
+ 18n(n-l) £ ^ 1 

l<fc<n/2 
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For the last sum we have 

, „ / -t \ Pr\c in fe, c' in same fe] 
18n(n-l) — —2 1 

l<k<n/2 

v—> Pr\c in fe] Pr\c' in same fe I c in fe] 
= 18n(n-l) — [ 



k 2 

l<k<n/2 

EPr[c in k] 
— l — - 
k 

l<fc<n/2 

< 2Ex[h] 
= O(logn), 

since Pr[c' in same k | c in k] < k/S(n — 1) (there are at most fe — 1 corners of the cycle 
containing c on different triangles from c, and the probability that a particular corner 
c' of the 3(n — 1) corners on triangles different from c is among them is thus at most 
(k - l)/3(n — 1) < k/3(n - 1)). Thus to prove (5.4) it will suffice to show that 

/ ^ Pr\c in k, c' in disjoint k'} , 9 . . . 

18n(n-l) ]T £ — ^ L__ = (log n) + O (log n) . (5.5) 

l<fc<n/2 k<k'<n/2 

We have 

Pr[c in fe, c' in disjoint fe'] 



18n(n-l) £ E 



fe-fe' 

l<fc<n/2 k<k'<n/2 



sr^ Prlc in fe] „ . . v-^ Prlc' in disjoint fe' I c in fe] 

= 6 " E -V^ 3 *"- 1 ' E -, — ! — 1 

l<k<n/2 k<k'<n/2 



(5.6) 



We can estimate Pr[c' in disjoint fe' | c in fe] the same way as we estimated Pr[c in fe] in 
Lemma 4.3 of the deriviation of Ex[/i]. The only difference is that we start with a situation 
in which up to fe ribbons may already have been installed. Thus we must replace fe by 
k + k' < 2k' in Lemma 4.3: 

If 2k' \ 

Prlc' in disjoint fe' I c in fe] < — ( 1 + — . 

1 J 1 J ~ 3n - 4k' + 1 V 3n - 4k' + 5 / 



23 



Thus for the inner sum in (5.6) we have 

n . v-^ Prlc' in disjoint k! I c in fcl 

3(n_1) Z £ 

k<k'<n/2 

< {n- ) 2^ F ' 3n - 4A;' + 1 V + 3n - 4fc' + 5 J 



- Z ( J./ 



___4 \ f_ 2k' 

\ hi + 3n _ 

fc<fc'<n/2 



+ ^ 7T~, ! 7" I 1 + 



A;' 3n-4/c' + l/ V 3ra-4fc'+5 
= logn — log + O(l). 

Furthermore, we still have 



3n - 2k + 1 V 3n - 2fc + 5 
from Lemma 4.3. Thus for the outer sum in (5.6) we have 

6n Pr[C fc ln fc] (logn-logfc + 0(l)] 

l<fc<n/2 1 V / 

£ 1'3^tt( 1 + 3^ T 6)( 1 ^->°^ + 0( 1 ) 

l<fc<n/2 V 1 / \ 

22 £ G + ra T 5)( 1 + sr3^T5)(^"- lo ^ + °( i : 

l<fc<n/2 V 1 / \ / \ 

= (logn) 2 + O(logra), 

where the main contribution comes from 

log n — log k , 9 . 

2 Z fc =(logn) 2 + 0(logn). 

l<fc<n/2 

This proves (5.5) and thus completes the proof of the theorem. □ 



6. The Probability of a Single Cycle 

We shall be concerned in this section with determining the probability 

Qn/2 = Pr[/i = 1] 



that the fat-graph constructed from n triangles contains a single boundary cycle. 
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Lemma 6.1: We have 

Q2s = 

for s > 0. 

Proof: Since /i = n/2 + x and x is even, h has the same parity as n/2. Thus Pr[/i = 1] = 
when n/2 is even. □ 

Our main result is the following. 

Theorem 6.2: We have 

_ 2 4a+1 3 3s+1 (4s + 2)! (6s + 2)! (6s + 3)! 
92s+1 ~ (s + 1)! (3s + 1)! (12s + 6)! 

for s > 0. 

Proof: Let t = 2s + 1 = n/2. Then if /i = 1, the single cycle has length 6t = 3n. We employ 
a technique attributed by Bessis, Itzykson and Zuber [12] to J. M. Drouffe. This proof is 
uses the irreducible characters % a of the symmetric group Set, indexed by the partitions 
a of 6t (that is, non-decreasing sequences of positive integers summing to 6£). The facts 
we need are found in the book by Sagan [46] . 

Let 

Qt = q t (6£-l)!! 

denote the number of pairings that result in a single cycle of length 6t. Then 

Qt = y^rTl,[23*1<Vr1,[6t1) (6-1) 

T 

where the sum is over permutations r in Set and a is a fixed permutation of congugacy 
class [3 2t ] (that is, the conjugacy class of permutations containing 2t cycles of length 3). 
From the completeness relation for the characters of the symmetric group Set (see Theorem 
1.10.3 in Sagan [46]), we have 

where the sum is over the partitions a of 6t, 4> and ip are permutations, [(f)] denotes the 
congugacy class containing 0, and #[0] denotes its cardinatlity. Substituting this relation 
in (6.1) yields 

Qt = #[ ?2f 2 [6t] Ex a ([2 3t ]) X'm) Y,X a (r) /H. (6.2) 

a,f3 t 
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We shall need the formula 

Ex-w^)= ^y . (6.3) 



This variant of the orthogonality relation for characters can be proved as follows. Let 
{ a i,j}i<i,j<v and {bp,q}i< p ,q< w be the matrix elements of the representations correspond- 
ing to x a and \^ \ respectively. From the proof of Theorem 1.9.3 in Sagan [46], we have 

/ x. / x (6t)!^ a ,/3<?i, g ^,p 

2^ a M( r ) 6 P,«( r ) - X «([l6t]) ' 

since v = x a ([l 6t ]). (Note that aij(r _1 ) = aj 5 j(r), since every element is conjugate to its 
inverse in the symmetric group.) Setting j = i and multiplying by b q ^ p (a) yields 



ai,i(T ) b q:P {a) b P:q {r ) = x a ([l 6f D ' 

Summing over p yields 

^a hl (T)b q:q (ar) = 

T 

summing over q yields 



(6t)lS a ,pS ijq a qi i(a) 



x a {[1 e t]) 



X Q ([1 6 *]) 

and summing over % yields (6.3). Substituting (6.3) in (6.2) yields 

#[2^]#[6t] ^ X Q ([2 3f ])x Q ([6t])x a ([3 2t ]) , . 

Qt ~ (6*)i ^ r([i 6 l) ' ( } 

since a G [3 2t ]. 

To evaluate the sum in (6.4), we observe that 

f (-I)* if a = [6* -p, IP]; 

X°([6t]) = 

[ 0, otherwise. 
(This is Lemma 4.10.3 in Sagan [46].) Thus, setting A(p) = [6£ — p, l p ], we have 

_ #[2«] #[6t] v (-1)"X > ""([2 3 '])X > ""([3 2 ']) 
* («)' o<^e, ^ (P) « 16< ]' ' 
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The denominator x A ( p )([l 6t ]) is the dimension of the representation coresponding to the 
partition X(p): 

x A(p) ([i 6t ]) = ( 6 '; 1 ). 

(This follows from the Hook Formula, Theorem 3.10.2 in Sagan [46].) The characters in 
the numerator are given by 



x Mp)^ 2 st ]) = (_i)p+Lp/2J 

and 



( LP/2J ) 



x x(p H[s 2t ]) = (-i) p+[p/3i ^ /3 1 1 ). 



(These follow from the Murnaghan-Nakayama Rule, Theorem 4.10.2 in Sagan [46].) Sub- 
stituting these relations in (6.5) yields 

# [2*] m H)^ + ^ (g^) (gg) 

(«)' oh, (V) 

Finally, we have #[2 3 *] = (6t - 1)11 and #[6£] = (6t - 1)!, which yields 

Q« = ^ £ Lil (6.6) 

0<p<6t V p / 

The sign pattern in (6.6) has a period of 12; this suggests that we aggregate the terms 
in groups of 6. Thus we make the substitution q = [p/6_|; the result is 



^ 6t 



E c^)* (V) (V) 



0<q<t-l V 6<j / 



4(6g+l) + (6g+l)(6g + 5) 



(6*-6g-l) (6t-6q- l)(6*-6g-5) 
Making the substitution t = 2s + 1, we obtain 



_ (12» + 5)!! 
e2s+1 " 12. + 6 X 



E 



(-1)' (T) (T) 



0<q<2s \ 6q ) 



4(6g+l) (6q + l)(6q + 5) 



(12s-6g + 5) (12s -6g + 5)(12s- 6q + 1) 
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Thus to prove the theorem, we must show that 



T(s)= F (s,Q) = h (6-7) 

0<g<2s 



where 

_ (-1)« (1 + 36q 2 + 4s- 72qs + 24s 2 ) ( 4s + 1 ) ( 6s 3 + 2 ) (s + 1)1 (3s + 1)1 (12s + 5)! 
~ 2 4s 3 3s (1 - 6q + 12s) (5 - 6q + 12s) ( 12s 6 + 5 ) (4s + 2)1 (6s + 2)1 (6s + 3)! 

Define 



G(s,q) = F(s,q)R(s,q), 

where 

t-> / \ Ms,q) 

R(s,q) = — r, 

B(s,q) 

A(s, q) = q (12s - 6q + 5) (12s -6q + l)x 

(36g(1440s 5 + 5328s 4 + 6656s 3 + 2560s 2 - 734s - 527)- 
36g 2 (1488s 4 + 4832s 3 + 5444s 2 + 2370s + 271)- 
1296g 4 (2s + 1) (s + 1) + 10368q 9 (2s + 1) (s + l) 2 - 
13824s 6 - 47232s 5 - 27984s 4 + 76288s 3 + 75909s + 15050) 

and 

B(s, q) = 2 4 3 2 (2s — q + 1) (2s - q + 2) (s + 1) x 

(2s + 3) 2 (6s + 5) (6s + 7) (24s 2 - 72qs + 4s + 36q 2 + 1). 

Then we have 

F(s + l,q)- F(s, q) = G(s, q + 1) — G(s, q). 
Summing this result over all q yields 

T(s + 1)-T(s) = 0, 

since F(s,q) vanishes outside the range of summation in (6.7). Since T(0) = l,we obtain 
(6.7) for all s > by induction. □ 

Corollary 6.3: We have 



Pi[h = 1] 



0, if n/2 even; 

^+0(^), if n/2 odd. 
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Proof: Applying Stirling's asymptotic formula in the form 



nl = 



(2tt?i) 1 /2 



n 



n 



( 




) 



) 



e 



to Theorem 6.2 yields 



This relation together with Lemma 6.1 yields the corollary. □ 

7. The Classification of Cycles 

In this section we shall introduce a classification of boundary cycles in the fat graph 
model, based on their "self-interactions" . In this classification, each cycle is either "simple" 
or "complex", and each simple cycle is of some finite order. We shall give a heuristic 
argument that predicts the expected number of simple cycles of each finite order. In the 
following sections, we shall give rigourous confirmations of these predictions for the two 
lowest orders. 

Let C be a boundary cycle in a fat graph. The self-interaction surface S(C) of C is 
the union of all ribbons traversed by C in both directions (that is, all ribbons with both 
links marked), together with all triangles incident with these ribbons (that is, all triangles 
visited more than once by C). 

If S(C) contains two ribbons incident with a given triangle, then it also contains the 
third ribbon incident with that triangle. Thus every triangle in S(C) is incident with either 
one ribbon or three. Since every ribbon is incident with two triangles, it follows that the 
number of triangles in a connected component of S(C) is even. 

We shall say that a connected component of S(C) is simple if it is simply connected, 
and that it is complex otherwise. A simple cycle will be said to have order b if every 
connected component of S(C) contains at most 2b triangles. Thus a cycle has order zero 
if it traverses no ribbon in both directions (or equivalently, if it visits at most one corner 
of any triangle); it has order one if it visits at most two corners in any triangle. 

The top pairing in Figure 1 yields two simple cycles of order zero and one of order 
one; the middle pairing yields three simple cycles of order zero; and the bottom pairing 
yields complex cycle. 

Let us now consider, in a heuristic way, the expected number of simple cycles of various 
finite orders. Let the random variable Sb denote the number of simple cycles of order at 
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most b. We start with cycles of order zero. A cycle cannot have a doubly traversed ribbon 
unless it has length at least four, and it must have a doubly traversed ribbon if its length 
exceeds n. Thus we expect most short cycles to have order zero, and most long cycles to 
have higher order. Consideration of the "birthday effect" (if there are n days in a year, 
then among n 1 ' 2 people in a room there is a significant probability that two have the same 
birthday) leads one to anticipate that the transition will occur for cycles of length around 
n 1 / 2 . From the results of Section 4, we have that the expected number of cycles of length 
k is about 1/k. Thus we anticipate that Ex[s ] ~ J^iKkKn 1 / 2 \ = ^°S n + 0(1). We 
can refine this estimate in the following way. Although all doubly traversed ribbons are 
excluded from cycles of order zero, we shall focus our attention on "minimal forbidden self- 
interactions" , which are isolated doubly traversed ribbons (that is, components of S(C) 
containing exactly two triangles joined by a single edge). We assume the number f of 
isolated doubly traversed ribbons is approximately Poisson distributed, so that Pr[/ = 
0] exp— Ex[/ ]. We shall see in the next paragraph that we have Ex[/ ] ~ k 2 /6n. This 
leads us to anticipate that 

Ex[s ] « - exp(-/c 2 /6n) 

^— ' k 

l<k<n 

1 7 

~ 2 lo §( 6n ) + 2' 
In Section 8 we shall confirm this prediction in the form 

Ex[ S0 ] = ^log(6n) + | + o(^. 

Let us now justify the approximation Ex[/o] ~ k 2 /6n. We consider how the algorithm 
of Section 4 can doubly traverse a ribbon between the arc (c^~, c\) of triangle t\ and the 
arc (c^~, C2) of triangle £2 (without doubly traversing any other ribbon incident with t\ or 
£2)- This can happen if corner c+ is drawn from the urn at some draw d±, then corner C2 
is drawn at the immediately following draw d\ + 1, and finally corner is drawn at some 
subsequent draw di > d\ + 1. The two triangles t\ and ti and their corners c\ and C2 can 
be chosen in about 9n 2 ways. The two draws d\ and can be chosen in about k 2 /2 ways. 
Each of the three draws occurs with probability about l/3n, for an overall probability of 
about l/27n 3 . The product of these factors must first be multiplied by a factor of 2, since 
the same ribbon is doubly traversed (with the traversals occurring in the opposite order) 
if C2 is drawn at d±, c\ is drawn at d\ + 1 and is drawn at c?2- Finally, we must divide 
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by a factor of 2, since the same ribbon is doubly traversed if c\ of t\ is exchanged with c 2 
of t 2 . The product of all these factors is k 2 /6n. 

We now turn to cycles of order at most one. Here the minimal forbidden self- 
interaction is an isolated triply visited triangle, and reasoning by analogy with the birthday 
effect we anticipate that these will begin to appear when k is about n 2 / 3 . This gives the 
estimate Ex[si] « J2i<k<n 2 / 3 \ = flog 77 - + 0(1). Again we can refine this result by 
estimating the expectation of the number fi of isolated triply visited triangles. In the 
following paragraph we shall argue that Ex[/i] w /c 3 /27n 2 , which leads us to anticipate 
that 

Ex[si] « ^exp(-A; 3 /27n 2 ) 

l<fc<2n 

2 , 2 7 
~ -logn + log3+ — . 

In Section 9 we shall confirm this prediction in the form 

Ex[ Sl ] = ?logn + log3 + | + o((^l). 

Let us now justify the approximation Ex[/i] ss k s /27n 2 . We consider how the algo- 
rithm of Section 4 can triply visit triangle 7j 4 by doubly traversing the ribbon joining its 
arc (04,04) to the arc (c^,ci) of a triangle ti, the ribbon joining its arc (c^,c^) to the 
arc (c 2 , c 2 ) of a triangle t 2 , and the ribbon joining its arc (04, c[ ) to the arc (c^ , C3) of a 
triangle 7:3 (without doubly traversing any of the other ribbons incident with t±, t 2 or £3). 
This can happen if corner cf is drawn from the urn at some draw d\, corner C4 is drawn 
at the immediately following draw d\ + 1, corner c 2 is drawn at the immediately following 
draw di + 2, then corner is drawn at some subsequent draw d 2 > d\ + 2 and corner C3 
is drawn at the immediately following draw d 2 + l, and finally corner is drawn at some 
subsequent draw d% > d 2 + 1. 

The four triangles t±, t 2 , ts and and their corners c±, c 2 , C3 and C4 can be chosen in 
about 81n 4 ways. The three draws d\, d 2 and ^3 can be chosen in about k 3 /6 ways. Each 
of the six draws occurs with probability about l/3n, for an overall probability of about 
l/729n 6 . The product of these factors must first be multiplied by a factor of 6, since the 
same triangle is triply visited (with the visits occurring in a different order) if the triangles 
ti, t 2 and ts are visited in any of the five other permutations of order considered above. 
Finally, we must divide by a factor of 3, since the same triangle is triply visited if c\ of t±, 
c 2 of t 2 and c 3 of t 3 are cyclically permuted. The product of all these factors is k 3 /27n 2 . 
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We now turn to cycles of order at most 6 > 2. If the minimal forbidden self-interactions 
in the fat graph are shrunk to parts of the corresponding thin graph, the results are trees 
with 26 + 2 vertices (6 + 2 leaves and 6 internal vertices) and 26 + 1 edges. These trees are 
cubic plane trees, since every internal vertex has degree 3 and the trees are regarded as 
embedded in the plane, so that two trees are regarded as isomorphic if there is a bijection 
between their sets of vertices that preserves adjacency and also preserves the cyclic order 
of the neighbours around each internal vertex. (For each 6 G {0, 1, 2, 3}, there is just one 
such tree, up to isomorphism; for each 6 > 4 there are more than one.) 

We need to estimate the expectation of the number of occurrences of such minimal 
forbidden self-interactions. Consider a cubic plane tree with 26 vertices. There are about 
(3n) 26+2 ways of choosing 26+2 triangles and one corner from each triangle. The forbidden 
self-interaction will occur if 36 + 3 specific corners are drawn in 6 + 2 series of draws. The 
probability of these corners being drawn is about l/(3n) 3b+3 , and the number of ways 
of choosing the 6 + 2 draws that begin the successive series is about k b+2 /(b + 2)!. We 
must multiply by a factor of (6 + 2)! accounting for the order in which the visits to the 
self-interaction at the draws that begin the successive series occur, and we must divide by 
a factor of 1, 2 or 3 accounting for the automorphisms of the tree. (A plane cubic tree 
can be symmetric about a vertex, with three automorphisms, or symmetric about an edge, 
with two automorphisms, or rigid, with just the trivial automorphism.) Thus we have 



k b + 2 T b 
(3n) 1 



Mh] - (7-1) 



where T b is the sum over all cubic plane trees with 26 + 2 vertices of the weight 1/q for 
each tree with q automorphisms. 

The eight trees corrresponding to minimal forbidden self-interactions for < 6 < 4, 
together with their weights, are shown in Figure 3. The four trees on the left arise for 
< 6 < 3; the four on the right arise for 6 = 4. 

Proposition 7.1: The sum of weights over plane cubic trees containing 26 + 2 vertices is 

1 /26 



(b + l)(b + 2)\b 

Proof: We shall consider rooted trees, in which one vertex is distinguished as the root. 
We begin by considering cubic plane trees that are rooted at a leaf. Let the generating 
function for such trees, in which the number of leaf-rooted cubic plane trees with a vertices 
is the coefficient of x a , be F(x). Such a tree either contains just two adjacent vertices, or 
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it can be constructed by identifying the two roots of two such trees to form the neighbour 
of a new leaf, which is the root of the constructed tree. Thus we have F(x) = x 2 + F{x) 2 , 
which implies 

I - (I- Ax 2 ) 1 / 2 
F{x) = U 3 ] ■ 

Let ^s(x) be the generating function for (unrooted) cubic plane trees with three 
automorphisms. Such a tree can be constructed by identifying the roots of three copies of 
a leaf-rooted cubic plane tree to form an internal vertex (the vertex of symmetry) of the 
constructed tree. Thus we have $3(2;) = F(x 3 )/x 2 , which implies 

^ , . 1 - (1 - 4aW 2 
= 2x~ 2 • 

Let $2(2;) be the generating function for (unrooted) cubic plane trees with two auto- 
morphisms. Such a tree can be constructed by identifying edges incident with the roots of 
two copies of a leaf-rooted cubic plane tree to form an edge (the edge of symmetry) of the 
constructed tree. Thus we have ^(x) = F{x 2 )/x 2 , which implies 

. . . l-u-te 4 ) 1 / 2 

= 2^ ' 

Let G(x) be the generating function for rooted cubic plane trees (where now the root 
may be any vertex, either leaf or internal). Such a tree is either a leaf- rooted cubic plane 
tree, or it can be constructed by identifying the roots of three leaf-rooted cubic plane 
trees to form an internal vertex of the constructed tree. This procedure constructs each 
internally rooted cubic plane tree three times, unless the three trees that are combined are 
isomorphic, in which case the resulting tree, which has three automorphisms, is constructed 
once. Thus we have G(x) = F(x) + F(x) 3 /3x 2 + 2F(x 3 )/3x 2 , which implies 

, 3-(l + 2x 2 )(l-4x 2 ) 1 /2_2(l-4x 6 ) 1 / 2 
° {X) = 6^ • 

Let Hi(x) be the generating function for rooted rigid cubic plane trees (where the 
root may be any vertex, and the trees have only the trivial automorphism). The number 
of rooted cubic plane trees with two automorphisms is given by the generating function 



u ( \ d F ^ 
H 2 (x) = V-, 

dy y 



l-(l-4x 2 ) 1 / 2 
2x 2 (l -Ax 2 ) 1 ' 2 
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and the number of rooted cubic plane trees with three automorphisms is given by the 
generating function 



H 3 (x)=x-F(y) 

Thus we have 

H(x) = G(x) -H 2 (x)-H 3 (x) 

6-(l + 2x 2 )(l-4x 2 ) 1 / 2 



2x 4 



y=x° 



(i-4x G y/ 2 ' 



i 



1 + 2x 6 



6x 2 2x 2 (l-4x 4 ) 3x 2 (l -Ax 6 ) 1 / 2 ' 

Let $>i(x) be the generating function for (unrooted) rigid cubic plane trees. We have 

HMdy 



Or) = I 
Jo 



y 



(1 - Ax 2 Y' 2 + 3(1 - 4x 4 ) 1 /2 + 2(1 - 4x 6 ) 1 /2 _ 6 



12x 2 

The generating function ty(x) for the numbers TJ, is then given by 



= ^i(y) + ^ 2 (y)x + ^ 3 (y) 

(1 - Axf/ 2 - (1 - 6x) 
12x 



y — gl/2 



The proposition now follows using the binomial theorem. □ 

The numbers 6Tb (which are integers) occur as a generalization of the Catalan numbers 
in the work of Gessel [47] , who asked for an enumerative interpretation of them. Such an 
enumerative interpretation is provided by considering drawings of cubic plane trees in 
which every edge is drawn at an angle that is a multiple of 7r/3 from a reference line. 
Cubic plane trees themselves are equivalent to "hexagons", which have been counted by 
Oakley and Wisner [48]. 

Substituting the result of Proposition 7.1 into (7.1) yields 

k b+2 1 (2b^ 



(3n) b + 1 (b+l)(b + 2)\b 



Thus we anticipate that 



EX[Sl1 " JL * ^ V (SO 1 * ' (»+D(»+ 2) U 1 1 
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We note that 

as b — > oo. This suggests the conjecture that the number of complex cycles is asymptotic 
to log4 = 1.386.... 



8. Cycles of Order Zero 

A simple cycle of order zero has length at most n (since each of n triangles is visited 
at most once) , and for 1 < k < n the expected number of such cycles of length k is 

1 3 k nl (3ra-2fc-l)!! 
k ' (n-k)\ (3n- 1)!! 

(The middle factor counts the ways of choosing an ordered list of k corners, no two on the 
same triangle. The last factor gives the probability of connecting these corners into a cycle 
in the order given. This counts each cycle k times, hence the first factor.) The expected 
number sq of simple cycles of order zero is thus 

1 3 fc n! (3ra-2fc-l)!! 

Ex[s ]= X 



Theorem 8.1: We have 



Proof: If we define 



V k (n-k)\ (3n-l)H 

Kk<n 



Ex[ S ] = ilog(6n) + ! + o(^ 



II (3n-3j' + 3) 

l<j<k 

II (3n-2j + l) 

l<j<k 

3-1 



n (i- 

i<j<fc 



n 



n i 

l<j<k v 



2J-1 

3n 



we can write 



k 

Kk<n 
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For the numerator of F k we have 



II (i-^)=«pEu*(i-i +0 g) 

l<j<k v 7 l<j<A: v v 7 

H +o (^) +o 

l<J<fc v v 7 

/c 2 ^ /AA _ z 7 /c 3 

r 

Similar estimation of the denominator yields 



i<i<fc 

Thus we have 



Ffc = exp( __ + o UoU 



Define 

/ = [(Gnlogn) 1 / 2 ]. 

Then we have 

Since F fc is a decreasing function of k, we have 



Kk<n y l<k<n J 



-Oil 



= o 



1 



n i/ 3 y • 

Thus to prove the theorem it will suffice to show that 



l<fc<Z 

For k < I we have 



/c 2 ^ / (logn) 3 / 2 



6n \ n 1 / 2 
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so that 

l<fc<Z V V 77 1<£;<Z v 7 

Thus to prove (8.1) it will suffice to show that 
Define 

TO = [(6n) 1/3 ]. 

Then for k < m we have 

F - = e *p ° ( J73) 
= 1 + °(^)- 

This yields 

£ r exp (-£) = ( 1 + G^)) £ s 

l<fc<m V 7 V V 77 l<k<m 

=( i+o (^))G iog(6n)+7+o (^))' 

where 7 = 0.5772 ... is Euler's constant. Thus to prove (8.2) it will suffice to show that 



We have 




since we may bound the difference between a sum and an integral by the total variation of 
the integrand. The substitution z = (Gny) 1 / 2 yields 

J m z \ 6nJ 2 i m2/6n y 
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Since 

//-7<*» V 



= o 



logn 

n 
;2 



we may raise the upper bound of the integral from I /6n to oo, and thus obtain an expres- 
sion in terms of the exponential integral: 

1 f l2/6Ul t \ j 1 r 1 / ^ J ^f l °Sn\ 
- - • exp(-y) dy = - - • exp(-y) dy + O [ 

where 



6n J \ n 



f°° 1 

— Ei(— x) = / — • exp(— w;) <iw 

Jx W 

(see Lebedev [49], §1.3). Using the asymptotic expansion 

-Ei(-x) = log - - 7 + O(x), 

we obtain 

1 / m 2 \ 1, 6n 7 ^ / m 2 \ 

-2 E '(-ftrj = 2 l08 ^-| +0 UJ 

1 7/1 
= - log(6n) - 77 + O ' 



6 taV y 2 V^ 1/3 , 
Combining these results yields (8.3), and thus completes the proof of the theorem. □ 



9. Cycles of Order One 

We begin by deriving an exact formula for the expected number of simple cycles of 
order one. 

Proposition 9.1: We have 



exn= E E 



l<fc<2n 0<j<fc/4 



k-2j 
2j 



+ 



k-2j-l 
2j-l 



(2j-l)!!x 



3 k - 2j n\ (3n-k + j-l)\ 



(n-k + 2j)\ 



(3n- 1)!! 



(9.1) 



Proof: If a cycle visits each of the n triangles at most twice, it can have length at most 2n. 
Furthermore, if a cycle visits each triangle at most twice, then each traversal of a doubly 
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traversed ribbon must be immediately followed by the traversal of a singly traversed ribbon, 
and thus such a cycle of length k can have at most fc/4 doubly traversed ribbons. Thus it 
remains to show that the summand in (9.1) is the expected number of cycles of length k 
with j doubly traversed ribbons. 

Consider a cyclic directed graph C = (V, E) with vertices V = {vi, . . . , Vk} and edges 
E = {(vi, V2), ■ ■ ■ , (vk-i,Vk), {vk, vi)}. A template T = (I, J) comprises a set I C E of 2j 
edges of C, subject to the condition that no two edges in I are consecutive, together with 
a partition J of I into j pairs of edges. The number of templates is 



partition these edges to form the j pairs in J. (We agree that (—1)!! = 1, as a special case 
of the formula (2j - 1)!! = (2j)!/2^ j!.) 



The cyclic group of order k acts on C in an obvious way, and this induces an action 
on the set of templates. Let p(T) denote the number of templates in the orbit of T under 
this action, and let q(T) denote the order of the automorphism group of T. Then we have 



Next consider a boundary cycle in the fat graph with length k and j doubly traversed 
ribbons. We shall construct a template as follows. Pick a corner c\ on the cycle, and 
then define C2, • • • , Ck to be the the successive corners of the cycle. For each ribbon that is 
doubly traversed by the links (c/, c^+i) and (c g , c g +i) of the cycle, put the edges (yf,Vf+i) 
and (v g ,Vg+i) into I and put the pair {(vf,Vf+i), (v g ,v g+ i)} into J. Different templates 
can be obtained from the same boundary cycle by different choices of the corner c\ \ p(T) 
different templates are obtained in this way, since if we advance the choice of c\ by p(T) 
corners we obtain the same template. 

Now consider the boundary cycles corresponding to a given template T. The cycle 
visits k corners on k — 2j triangles, but once the first visited corner on a triangle is chosen, 
the remaining visited corner on that triangle is determined. The triangles can be chosen 
in n\/(n — k + 2j)\ ways, and the first visited corner on each triangle can be chosen in 
3 fc-2 -? ways. Different choices of triangles and corners can produce the same boundary 
cycle; each boundary cycle is produced q(T) times, since if we advance the choices of 
vertices k/q(T) = p(T) positions we produce the same cycle. For such a cycle to occur 
in the fat graph, k — j specific draws must occur, corresponding to the first traversals 





) ways of choosing the 2j edges in I (there are {2^) wa Y s 
and ( fc 2 " 2 -]" 1 ) wa y s that include it), and (2j — 1)!! ways to 



p(T)q(T) = k. 
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of the k — j ribbons traversed by the cycle. The probability of these draws occurring is 
(3n-/c + j-l)!!/(3n-l)!!. 

Finally, the expected number of boundary cycles of length k with j doubly traversed 
ribbons is 

^ 1 3 k ~^nl (3n-k + j-l)U 

^rtT)'q{T) ( n -k + 2j)\ (3n - 1)!! ' 

where the sum is over all templates T. Since p(T) q(T) = k, we obtain the summand in 
(9.1). □ 

In principle, the asymptotic behaviour of Ex[si] can be obtained by direct analysis 
(9.1). We have found it most convenient, however, to bound the terms with the smallest 
and largest values of k through separate arguments, and thus to analyze (9.1) only for a 
set of intermediate values that includes the transition (around n 2 / 3 ) from significant terms 
to negligible ones. We give these separate arguments in the following two propositions. 

Proposition 9.2: Let m = n 5 / 8 . The expected number of cycles of order at most one and 
length at most m is 

log m + 7 + O 



1 



n 



1/8 



Proof: From the estimates in the proof of Theorem 4.1, we have that the expected number 
of cycles of length at most m is 

£ G + Q))= log ™^ + (^7s)- 

l<k<m V \ / / \ / 

The obstruction to a cycle having order at most one is a triply visited triangle. Thus it will 
suffice to show that the expected number of cycles of length at most m that contain a triply 
visited triangle is (^(l/n 1 / 8 ). This expectation is in turn at most the expected number of 
triply visited triangles that are created during the first m iterations of the algorithm of 
Section 4. 

There are three ways in which a triply visited triangle can occur: the three ribbons 
incident with it can be incident with three other triangles, or with just two, or with just 
one. The expected number of triply visited triangles for which the ribbons are incident 
with three other triangles was dealt with heuristically in Section 6. This argument can be 
made rigourous, at the cost of lowering our sights to an upper bound that holds within a 
constant factor. There are 0(n 4 ) ways of choosing corners ci, C2, cs and C4 of triangles 
£1, £2, ts and £4; there are 0(m 3 ) ways of choosing three draws d\, di and c?3; and the 
probability that the corners drawn at draws di, d\ + 1, d\ + 2, efe, di + 1 and c?3 create 



40 



a triply visited triangle is 0(l/n 6 ). Thus this expectation is 0(m s /n 2 ) = 0{l/n l / s ). 
A similar argument shows that the case in which the three ribbons incident with the 
triply visited triangle are incident with two other triangles gives a smaller contribution, 
0(n 4 k 2 /n 6 ) = 0(m 2 /n 2 ) = 0(l/n 3/4 ). Finally, the case in which the three ribbons are 
all incident with one other triangle (which can happen only if k = 6) gives a still smaller 
contribution, 0(n 2 /n 3 ) = 0(l/n). □ 

For the next proposition, we shall need the following lemma. 

Lemma 9.3: Let X denote the number of successes among h trials that each succeed 
independently with probability at most p. Then 

PrLY > 2hp] < (e/4) hp . 

If on the other hand the trials succeed with probability at least q, then 

PrLY < hq/2] < (2/e) hq/2 . 

Proof: For the first inequality we may assume that the trials succeed with probability 
exactly p, since the resulting random variable is majorized by X. We may also assume 
that p < 1/2, since otherwise PrLY > 2hp] = 0. If 

( 0, if X < 2hp, 
\ 1, if X > 2hp, 

then PrLY > 2hp] = Ex[Y]. If Z = T x ~ 2hp (where T > 1 is a parameter to be chosen 
later), then Y < Z and so Ex[F] < Ex[Z]. Thus it will suffice to estimate Ex[Z]. Since 
X is the sum of h independent random variables that assume the value 1 with probability 
p and the value with probability 1 — p, T x is the product of h independent random 
variables that have expected value pT + 1 — p. Thus 

Ex[Z] = (pT + l-p) h T- 2hp . 

Choosing T = 2(1— p) / (1 — 2p) and using the inequality 1+x < e x yields the first inequality 
of the lemma. A similar argument yields the second inequality. □ 

Lemma 9.3 is an instance of Chernoff 's inequality. 

Proposition 9.4-' Let / = \n 2 ^ 3 log n] . Then the expected number of cycles of order at most 
one with length exceeding 12/ is 0(l/n). 
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Proof: From the proof of Theorem 4.1, we have that the expected number of cycles of 
order at most one with length exceeding 12/ is 

^ Pr[co in a /c-cycle of order at most one] 



k>12l 

< (n/4l) Pr[co in a cycle of order at most one and length exceeding 12/] 
Thus it will suffice to show that 



This probability is at most the probability that the algorithm of Section 4 runs for 6/ 
iterations without creating a triply visited triangle. 

To estimate this probabilty, we shall use an analysis that assigns colours to the corners 
in the urn during the execution of the algorithm. Corners will initially be white, but may 
subsequently be recoloured red or blue. The rules for colouring will be such that if a blue 
corner is drawn from the urn during the first 6/ iterations, then either the cycle containing 
Co has length less than 12/ or it contains a triply visited triangle. Thus it will suffice to 
show that the probability that no blue corner is drawn during the first 6/ iterations is 



The rules for colouring are as follows. Initially, all corners are white except for Cq~, 
which is red, and Cq , which is blue. If ever a blue corner is drawn, we stop the analysis. 
Whenever a white corner c is drawn, then if the immediately preceding corner drawn was 
white, we colour c + red, but if the immediately preceding corner drawn was red, we colour 
c + blue. It is easy to verify that drawing a blue corner either closes the cycle (which then 
has length at most 12/) or creates a triply visited triangle. (It is possible to create a triply 
visited triangle without drawing a blue corner; the rules given have been chosen to be as 
simple as possible while yielding the desired upper bound.) 

The rules specify how corners in the urn become coloured. There are two ways in 
which coloured corners can leave the urn: they can be drawn or they can be removed from 
the urn after becoming the head of the marked path. But at any given iteration, a corner c 
can be removed in this way only if one particular corner d (such that d~ is the rear of the 
unmarked path whose front is c) is drawn. Thus the probability of removing a coloured 
corner from the urn is never greater than the probability of drawing it. 

We shall now obtain estimates for the numbers of corners of various colours in the urn 
at various times during the first 6/ iterations. We shall divide these iterations into three 
phases (phases I, II and III), each comprising 2/ iterations. 




Pr[co in a cycle of order at most one at length at least 12/] = O 




0(l/n). 
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At any time during these 6/ iterations, at most 12/ corners have been drawn or removed 
from the urn, and thus the urn always contains at most 3n and at least 3n — 12/ > 3n/2 
corners. Since at most one corner is coloured per iteration, there are always at least 
(3n — 12/) — 6/ = 3n — 18/ white corners in the urn, and thus the probability of drawing a 
white corner is always at least (3n — 18/)/3n > 1/2. 

We start by considering the number of corners coloured red during phase I. A corner 
is coloured red whenever two consecutive draws yield white corners. For each of / disjoint 
pairs of consecutive draws, this probability is at least (1/2) (1/2) = 1/4. Using Lemma 9.3 
with h = I and q = 1/4, we have that, except with probability at most (2/e) Z//8 , at least 
Z/8 corners are coloured red during phase I. 

Next we consider the number of red corners drawn or removed during phases I and 
II. During these 4/ iterations, there are never more than 4/ red corners in the urn. Thus 
the probability that a red corner is drawn or removed in any iteration is at most 8//3n. 
Using Lemma 9.3 with h = 41 and p = 8//3n, we have that, except with probability at 
most (e/4) 32 ' / 3n , at most 64/ 2 /3n red corners are drawn or removed during phases I and 
II. Thus, except with probability at most 

(2/e)*/ 8 + (e/4) 32 * 2 / 3 ™, 

there are at least 1/8 — 64/ 2 /3n > Z/16 red corners in the urn throughout phase II. 

Next we consider the number of corners coloured blue during phase II. A corner is 
coloured blue whenever two consecutive draws yield a red corner followed by a white 
corner. For each of / disjoint pairs of consecutive draws, this probability is at least 
( (Z/16) /3n) (1/2) = Z/96n. Using Lemma 9.3 with h = I and q = Z/96n, we have that, 
except with probability at most (2/e) z2 / 192n , at least Z 2 /192n corners are coloured blue 
during phase II. 

We shall also need an upper bound, holding with high probability, for the number 
of corners coloured blue during phases I, II and III. For each of 6Z draws, the probability 
of colouring a corner blue is at most (6Z/(3n/2)) = 4Z/n. Using Lemma 9.3 with h = 61 
and p = 4Z/n, we have that, except with probability at most (e/4) 24 ^/ n , at most 48Z 2 /n 
corners are coloured blue during phases I, II and III. 

Next we consider the number of blue corners drawn or removed during phases I, II 
and III. Assuming that there are never more than 48Z 2 /n blue corners in the urn during 
these 6Z iterations, we have that the probability that a blue corner is drawn or removed 
in any iteration is at most (48Z 2 /n)/(3n/2) = 32Z 2 /n 2 . Using Lemma 9.3 with h = 61 
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and p = 32/ 2 /n 2 , we have that, except with probability at most (e/4) 192z3 / n2 , at most 
3841 s /n 2 blue corners are drawn or removed during phases I, II and III. Thus, except with 
probability at most 

( 2 /e)'/ 8 + (e/4) 32 ' 2 /3- + (2/ e f/ 192n + {e/4) 2A[2 ' n + (e/4) 192 ' 3 /" 2 , 

there are at least Z 2 /192n — 384/ 3 /n 2 > / 2 /384n blue corners in the urn throughout phase 
III. 

Finally we consider the probability that no blue corner is drawn during phase III. 
Assuming that there are at least / 2 /384n blue corners in the urn throughout phase 
III, we have that the probability of drawing a blue corner at each iteration is at least 
(7 2 /384n)/3n = / 2 /1152n 2 , and thus that the probability of not drawing a blue corner in 
any of these 21 iterations is at most (1 — / 2 /1152n 2 ) 2 ' < (1/e)' 3 / 576 " 2 . Thus, except with 
probability at most 

(2/e)'/ 8 + (e/4) 32 ' 2 / 3 ™ + (2/e)' 2 / 192n + (e/4) 24 ^ + (e/4) 384 ^ 2 + (1/ef / 576n \ 

either the cycle closes or a triply visited triangle is created before the length of the cycle 
reaches 12/. These six contributions are all 



exp-0((logn) 3 )=o(^. 



□ 

Theorem 9.5: The expected number of cycles of order at most one is 

ExM = ^logn + log3 + ^ + 0^ (1 ° gn)7 



3 to to 3 V ™ 1/8 
Proof: Using Propositions 9.1, 9.22 and 9.4, it will suffice to show that 



12 k 

m<k<12l 0<j<k/4 



k - 2j\ (k - 2j - 1 



2j J V 2J-1 



(2j - 1)!! x 



3 fc " 2j n! (3n-k + j-l)U 1, 27n 2 7 ^ /(logn) 7 ', 

= o lo S ~Z?T - o + °\ —T7T- » ( 2 ) 



(n-k + 2j)\ (3n-l)!! 3 ta m 3 3 \ n 1/8 

where m = n 5 / 8 and / = |~n 2 / 3 log n\ . To do this, we shall use a "bootstrapping" technique, 
whereby crude estimates are used to reduce the range of the summation over j, so that 
more delicate estimates will be applicable over the reduced range. Specifically, we shall 
show that for certain terms the inner summand in (9.2) is (9(l/n 3 ). Since there are 0(n 2 ) 
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such terms, this will imply that the total contribution of these terms is 0(l/n), and thus 
can be neglected. 

First, we shall show that all term with j > 6a, where 



a 



Jl. 

6n' 



may be neglected. Using the estimates 



k - 2j\ (k - 2j - 1\ (k- 2j\ 2 k 2j 

23 ) + \ 2j-i )- v 2j ; - m ' 



(2;-D!! = §f <(2i)! (£) , 



3 fc ~^ nl 
(n - k + 2j)\ 



< (3n) 



and 



(3n-fc+j-l)!! 1 fk\ 

(Sn - 1)!! " (3n)^ 6XP V3n + W \n 2 



< 



1 



~ (3n) fc -J 

we have that the inner summand in (9.2) is at most 



j— exp (2a + 0((logn) 3 )), 



(9.3) 
(9.4) 



(9.5) 



en 



exp(2a + 0((logn) 3 )) 



This is a decreasing function of j for j > a, and for j = 6a it is equal to 

6a 



,4/3 

IT 



expO ((logn) 3 ) =exp (-fi(?i 1/4 ) +0 ((1 ognj II, 



since k > m = fi(n 5 / 8 ). Thus all terms with j > 6a are 0(l/n 3 ) and may therefore be 
neglected. Thus it will suffice to show that 



k ^ 
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Next, we shall show that all terms with j < a — d or j > a + d, where 

d = a 1 / 2 (\ogn) 2 , 

may be neglected. Using the estimates (9.3), (9.4), 

3 k ~ 2j nl Wo u _ 2? - / (k-2j) 2 /fc\ ^(k 3 
(n-k + 2j V . * ^ 3 6XP (—2^- + °{n) + °{^ 



k - 2 * exp(-^ + ^ + 0(-)+0 

2n n \ n j \n 



< (3n 

< (3n) k ~ 2j exp (-3a + ((logn) 3 )) 



fc 3 

2 



(where we have used j < 6a) and (9.5), we have that the inner summand in (9.6) is at 
most 



ea 
J 



< exp 



exp (-a + O ((logn) 3 )) = exp ( a ( - - - log - - 1 ) + O ((logn) 3 ) 

a a a 

2 



(j - a Y 
6a 



+ 0((logn) 3 ) , 



since x — xlogx — 1 < — (x — 1) 2 /6 for < x < 6 and j < 6a. Thus if j < a — d or j > a + d, 
we have that the inner summand in (9.6) is at most 

exp (-^- + O ((logn) 3 )) = exp (-fi((logn) 4 ) + O ((logn) 3 )) . 

Thus all terms with j < a — d or j > a + d are 0(l/n s ) and may therefore be neglected. 
Thus it will suffice to show that 
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Finally, we have the estimates 
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(where we have used j = a + 0(d) and k = 0(1)), 



w-i)" = §f 



2irj J \2j J \j 



1 V' 2 / e V / (log??.'' 2 



(where we have used j = a + 0(d) and = O(m)), 
3 fc " 2 ^ n! 



(n-/c + 2j)! 



= ( 3„)^exp(-^ + BM_il + g 



2n 6n 2 V ™ 1/6 
(where we have used j = a + 0(d) and k = 0(1)) and 
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(where we have used j = a + 0(<f) and = 0(1)). Thus the inner summand in (9.7) 
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since x — x logx-1 = (x-l) 2 /2 + 0(|x-l| 3 ), j = a + 0(d) and k > m = fi(n 5 / 8 ). Thus 
the inner sum in (9.7) is 

1 \ 1/2 / k 3 ^/(logn) 6 \\ (j'-a) 2 



2naJ eXP r^ + °(^7^Jj ^ 6XP 2a 

a—d<j<a+d 



We have 



\- (j ~ a) 2 \- * 2 

2^ ex P ^— = 2^ ex P-— 



2a ^ r 2a 

a—d<j<a+d —d<i<d 



f d e 



de+o(i), 



since the error in estimating a sum by the corresponding integral is at most the total 
variation of the summand. Furthermore, we have 

exp -—d£ = a 1/2 / exp — - drj 

-d Za J-d/a 1 / 2 1 

/°° n 2 
exp — - dr] + exp -0((logn) 4 ) 
-oo ^ 

= (27ra) 1/2 + exp-0((log?i) 4 ), 

since J*_^ exp — ^ dry = J^exp—^dt] < ^ exp — d/a 1 / 2 = (logn) 2 and 
exp d?7 = (27T) 1 / 2 . Thus the inner sum in (9.7) is 

( k 3 ^ ( (logn^ 6 
exp I -— -o+O' 



27n 2 V n 1 / 8 
Summing this result over m < k < 121 yields 



^/(logn) 6 \ ^ 1 k 3 



m<k<12l 

Evaluating this sum in the same way as the one in Section 8, we have 
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— exp — 77 drj + O 

m :i /27n 2 ?7 



n 5/8 



1 , 27n 2 7 / 1 
log — - ^ + O ' 



3 m 3 3 V nV8 J ' 
48 



since J^ 3 - exp — r\ dr\ = log ^ — 7 + 0(a) for a — > and J*^° - exp —77 drj < \ exp — /5 for 



/3 — > 00. Substituting these results into (9.8) yields (9.7) □ 
10. Conclusion 

The principal problem left open by the present work is to determine completely the 
probability distribution for the random variable h. If, for example, this were done by giving 
the generating function for h, one could presumably obtain the moments by differentiation 
(as was done for b! and h" in Section 2), and thus confirm or refute the conjectured 
asymptotics of Ex[/i] and Var[/i]. The most promising approaches to this problem appear 
to lie in the connection with matrix models. An analogous model with a quartic interaction 
has been described by Bessis, Itzykson and Zuber [12], who relate the coefficients in a 
conjectured asymptotic expansion to the enumeration of certain graphs according to their 
genera. The existence of this asymptotic expansion and the interpretation of the coefficients 
has been rigourously established by Ercolini and McLaughlin [50]. Their approach does 
not, however, appear to provide a derivation of the results of this paper, let alone our more 
precise conjectures. Another approach would be to extend Harer and Zagier's [43] analysis 
of glueings of the (2n)-gon to n triangles (see also Penner [44] and Itzykson and Zuber 
[45]). This approach leads naturally to the £-fold integral 



If this integral could be evaluated as an analytic function of £, the result would be (apart 
from an easily calculable normalizing factor) the desired generating function for h. 

Another open problem is to determine the asymptotic behaviour of the expected 
number of complex boundary cycles in the fat-graph model. The heuristic analysis of 
Section 7 suggests that this number tends to a constant, log 4, but we cannot rigourously 
exclude either that it grows unboundedly or tends to zero. Yet another problem is to 
analyze the effect of dropping that requirement that all surfaces be orientable; this could 
be done by allowing each glueing to occur in either of two ways with equal probability. 

Finally, our model assigns equal probabilities to all possible glueings of the triangles, 
with the result that the most likely outcome is a connected surface of high genus. It would 
be of interest to explore the consequences of departing from this assumption. One could, 
for example, associate a self-interaction energy with cycles that doubly traverse ribbons. 
Such an interaction that penalized double traversals of ribbons would presumably tend to 
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increase the total Euler characteristic, increasing the number of components and decreasing 
their genera, with results that would be more similar to matrix models in the large- N limit. 
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